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Abstract 

We study the problem of translating updates of database views. We disambiguate a view 
update by requiring that a specified view complement (i.e. a second view which contains all the 
database information omitted from the given view) remains constant during the translation. We 
study some of the computational problems related to the application of this general methodology 
in the context of relational databases. 

We consider, for the most part, databases consisting of a single relation, with functional 
dependencies as the only integrity constraints; we also restrict our attention to views defined by 
projections. We first give a characterization of complementary views (valid also in the presence 
of join dependencies), which leads to efficient algorithms for checking if two given views are 
complementary and for determining a non-redundant complement of a given view. We also 
show that the problem of finding a minimum complement of a given view is AT-complete. 

We then study in detail the problem of translating the insertion of a tuple into a view. We 
show how to do the translation in case the insertion is translatable, and we also develop a 
polynomial time algorithm for testing translatability; we also give two stronger, more efficient 
translatability tests. We show lower bounds for the complexity of the translatability problem, by 
proving that it becomes n/-hard if the view is given in an exponentially succinct way; an 



analogous result is shown for one of the stronger tests. We also examine the problem of 
determining a complement which renders a given insertion translatable; we find that it can be 
solved in time polynomial in the view, but becomes NP-hard if the view is given in an 
exponentially succinct way; again, analogous results are valid for the stronger tests. 

The above results are extended, in a straightforward way, to the cases of deletion and 
replacement of a tuple. Finally, we define and study a new kind of functional dependencies 
which is important in the context of complements, the explicit functional dependencies (EFD's), 
which intuitively state that some part of the database information can be computed from the 
rest We examine the interaction of EFD's with functional dependencies and join dependencies, 
and we also extend our characterization of complementary views to allow for the presence of 
EFD's. 
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1. Introduction 

/./. The Problem and the Approach 

In database systems, the amount and structure of the stored data is decided by the database 
administrator. However, individual users often want to deal with only part of the information in 
the database, and moreover they may want to restructure it in a way suitable to their needs. For 
this reason, database systems often provide the view facility. A view is defined by giving a query 
on the whole database. At any point, the contents of the view is just the outcome of this query. 
The user queries and updates the view as though it were a database in itself, with no reference 
to the underlying database. The view idea spares the user from the conceptual complexities of 
the whole database, makes queries easier by "factoring out" a common subexpression, and can 
serve as a protection mechanism by restricting access to only insensitive information. A view 
facility is an important part of many relational database systems, e.g. PRTV [T], QBE [Zl], 
System R [As] and INGRES [SWKH] (as well as of database systems designed along the lines of 
the network data model, like DBTG [CO], or the hierarchical data model, like IMS [D, I]). 

In relational database systems, a view is in general implemented by naming and storing its 
definition, which is just a query definition in the query language of the system. Queries on the 
view are translated into database queries by composing them with the view definition. Thus, 
querying a view presents no serious conceptual problems. 

What is much more complex is the subject of updating a view. A simple update operation, 
such as inserting a tuple in the view, may create formidable problems. The underlying database 
update may be ambiguous, ill-defined, create inconsistencies in the database, or have side-effects 
on the view. This problem seems to be related to such fundamental issues as null values [Co4, 
Za2] and update anomalies [Col, Co3, BBG] in relational databases. Most existing systems do not 
allow updates of views (e.g. PRTV, QBE), or allow them only in the trivial case in which the 
view consists of one of the database relations. This omission apparently reflects our poor 
understanding of the subject 



In one of the first works dealing with view updates, Dayal and Bernstein [DB] stipulated a 
notion of correct translation of a view update, and gave some straightforward conditions for the 
existence of such translations . From this and subsequent works, e.g. [RS, Ca, FSD], it became 
apparent that we need a method for assigning semantics to view updates. This method should be 
formal (resolving the delicate ambiguities involved) and simple (so the users would define the 
semantics themselves, perhaps with the aid of the query system). 

An excellent solution to this problem was suggested in the work of Bancilhon and Spyratos 
[BS, Sp]. They developed an elegant theory (quite independent of the relational model) of 
database mappings, i.e. functions from database states to database states. A view v is such a 
mapping, and so is an update u on the view. How can we translate u? The translation, T w must 
be such that the updated database maps via v into the updated view. As may be suspected, there 
are typically many T u \ so the problem remains. Bancilhon and Spyratos resolve this ambiguity 
by the notion of the complement of a view. A complement of v is another view v\ such 
that the mapping s-+(\(s), v'(s)) (where s denotes the database state) is one-to-one. In other 
words, any information lost by v can be recovered by v. A view has many complements (for 
example, the identity mapping is a complement of all views). Choosing a complement that must 
remain constant assigns unambiguous semantics to a view update. The scenario is the following: A 
user defines a view. Before updating the view, the user must define (probably with the assistance 
of the system) another view (a complement of the first), which must be held constant during 
updating (this corresponds to the "rectangle rule" of [Ch] and the "absence of side-effects" of 
[DB]). Using this information, the system translates (or rejects as untranslatable) the user's 
updates. 

Translating under constant complement amounts to finding a database state s such that 
v(s')=uv(s) and v'(s')= v'(s). By the definition of a view complement, s will be unique if it exists 
at all. Thus, if such an s can be found for any s (in which case we say that u is v- 
translatable), we can translate u as the database update 7' u =(vXvT 1 (wvXv'). The soundness of 
the overall approach is demonstrated by the following facts [BS]: 

i) T u is consistent, i.e. the updated database always maps, under the view definition v, on 



the updated view (formally vT u = uv ); also T u is acceptable, meaning that if u does not change 
the view, no change is made on the database either (i.e. for all s, uv(s)=v(s) implies TJs)^). 

ii) Suppose U is a set of view updates which is reasonable in the sense that it satisfies 
minimal user requirements, i.e. it is closed under composition and there is a means to cancel the 
effect of every allowed update on the view (formally, if u, w £U then uw £U, and if s is a 
database state and u£U, there is an update w£U such that wuv(s)=v(s)). If v' is a view 
complement such that any update in U is v'-translatable, then the mapping which associates to 
an update u in U the database update T u is a morphism, i.e. T UW =T U T W for all u, w €U 
(clearly, any reasonable way to translate a set of updates should have this property, i.e. the result 
of the translation should be the same whether the user applies two updates from the set one 
after the other or their composite update). On the other hand, the converse also holds: if T is a 
mapping on U such that, for every u€U, T(u) is a consistent and acceptable database update, 
and also T is a morphism (i.e. T is a reasonable way to translate view updates into database 
updates), then there is a view complement v' such that, for every u€U, u is v'-translatable and 
T(u)=T u . 

However, as was pointed out earlier, this approach is essentially independent of any 
particular data model. In this work we investigate some of the issues and problems which arise 
when one attempts to apply this methodology in the context of the relational model, with a view 
towards rendering it realizable in practice. In the remaining part of the Introduction we review 
briefly some basic concepts and notations of the relational model, and we also give an overview 
of the results obtained. 

1.2. The Relational Model - Basic Definitions 

The relational model [Col] assumes that the data are stored in two-dimensional tables called 
relations. The columns of a table correspond to attributes, and the rows to tuples (records). Each 
attribute A has an associated domain of values D^ y and a tuple is viewed as a mapping from the 
attributes to their domains. We use the letters A, B, C, ... to denote attributes and the letters ... , 
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X, Y, Z to denote sets of attributes. A relation scheme is a finite set of attributes labeling the 
columns of a table, and is usually written as a string of attributes. If X is a relation scheme 
labeling the columns of a relation R, we say that R is defined over X. A database scheme D is a 
finite set of relation schemes. A database over D is a set of relations containing exactly one 
relation over each relation scheme in D. 

In the context of the relational model, one way of formulating queries (and thus of defining 
views also) is by using a set of operators defined on relations (relational algebra [Col, Co2]). The 
operators that will be of primary interest to us are projection and join. The projection of a tuple 
t to a set of attributes X, written i[X], is simply the restriction of / to X; the projection of a 
relation R to X, written vj^R), is the set of projections of the tuples in R to X. If R /, R 2 are 
relations defined over the relation schemes Xj, X 2 respectively, the join of Rj and Rp written 
R]*R} is the relation over X } UX 2 consisting of all the tuples p such that plXfltRj, n[X 2 ]€R 2 . 

Semantic information is captured by means of integrity constraints (i.e. predicates on 
relations), usually expressed as first-order sentences called dependencies [Col]. Various kinds of 
dependencies have been defined and studied in the literature; we will be primarily concerned 
with junctional dependencies and multivalued dependencies (the latter are a special case of join 
dependencies). 

A junctional dependency (FD) [At, Col] is a statement of the form X-* Y, where both X and 
Y are sets of attributes. The FD X-> Y holds in a relation R if for all tuples /i and v of R, if 
^[X]=p[X], then mM^M 

A join dependency (JD) [R2] is a statement of the form *[/?/, ... , RJ, where each R t is a 
relation scheme. The JD *[Rj, ... , RJ holds in a relation R if * v R (R)=R. A multivalued 
dependency (MVD) [F, Zal] is a JD with at most two relation schemes. An MVD *[R } , R$ is 
also written as RjnR 2 -^-*Rj (or equivalently fyn/^-*-*^)- 

Finite sets of dependencies will be denoted by 2. A database schema S is a pair (D, 2), 
where D is a database scheme. An instance (Le. a database DB over D) which satisfies the 
dependencies in 2 is called legal (notation: £>5N2). 
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For a more detailed exposition of the fundamendal notions and notations of the relational 
model and of the relevant theory, see [Ul]. 

1.3. Outline of the Thesis 

This work is mainly centered around the study of the computational problems arising when 
one attempts to apply the general methodology proposed by Bancilhon and Spyratos in the 
context of the relational model. We discover that very interesting theoretical questions already 
arise at very simple cases of the application. In particular, we concentrate on database schemas 
consisting of a single relation, with integrity constraints which are (for the most part) just 
Junctional dependencies. The views we consider are simply projections of the relation. Working 
with a single relation corresponds to some unrealistic universal relation assumption [U2], but it 
yields a simplified problem which must be conquered first. Functional dependencies constitute a 
simple and practical class of constraints. Projective views are, again, the simplest imaginable, and 
they are also important from a practical point of view. 

In Section 2 we characterize when two projections are complements of each other. There is 
an interesting parallel between this characterization and the notion of independence of Rissanen 
[Rl]. Our necessary and sufficient condition (which can be generalized to include the presence of 
join dependencies) states that the common part of the projections must be a superkey of one of 
the projections. As a consequence, it is easy to test whether two given projections are 
complementary in a schema. It is also possible to construct a nonredundant (minimal) 
complement of a given projection in polynomial time. Unfortunately, finding a smallest (i.e. with 
fewest attributes) complement of a given projection is shown to be iVf-complete. 

In Section 3 we study how to implement the insertion of a tuple into a projection, keeping 
a given complementary projection unchanged. We show that this can be done in a unique way, 
and so the problem reduces to testing whether the resulting database is consistent We show that 
this test can be carried out in time cubic in the number of tuples of the view. Since this is likely 
to be impractical, we also develop two alternative stronger tests that can be executed more 
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efficiently. 

Ideally, we would like the time complexity of our update algorithms to depend on the 
number of attributes, functional dependencies, and other parameters of the schema, not of the 
instance. When the time must depend on the number of tuples, we would at least like this 
dependence to be logarithmic, since this number is expected to be very large. However, 
complexities like those described in the previous paragraph resemble, in a practical sense, 
exponential complexities. We show some negative complexity results which suggest that this 
"exponential" behavior is inherent: The translatability problem becomes n/-hard [St] if the 
view is represented in some exponentially succinct way (e.g., as the union of two Cartesian 
products). Even one of the simpler, stronger tests mentioned above becomes co-WT-hard 

Finally, we examine the complexity of finding a complement which renders a given 
insertion translatable. We show that this problem is polynomial in the number of tuples of the 
view, but inherently exponential in the size of the schema (and the logarithm of the number of 
tuples of the view). Similar results can be obtained for the two stronger tests. 

In Section 4 we extend these results to the case of deletions and replacements of tuples. We 
find that, for the most part, the extension is rather straightforward. Finally, in Section 5 we 
define and examine a new kind of functional dependencies which is important in the context of 
complements, the explicit junctional dependencies. We extend our characterization of 
complementary projections to also allow for the presence of explicit functional dependencies. 
Section 6 concludes this work by pointing out some directions for further research. 
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2. Defining a Complement 

Let S be a database schema (U, 2), where U is a universal set of attributes and 2 is a finite 
set of dependencies. A relation R over U (instance pf V) is called legal if it satisfies all the 
dependencies in 2 (notation: AN 2). A view of S is for us a projection defined by a subset J of 
U. For each instance R, the corresponding instance of the view is vj((R). We disambiguate 
updates on a view by defining a «?conrf v/ew, Y, the complement of X Two views * and K are 
called complementary if 7rj^i?;= v^R') and w yW= w y(R') imply tf = #', whenever R and tf ' 
are both legal instances. In other words, the two views together contain enough information to 
reconstruct the whole database. 

When are two views X and Y complementary? Clearly, a sufficient condition is that the 
MVD *[X, Y] holds in every legal instance, i.e. 2 implies the MVD *[X, Y). If this is the case, 
the database can be reconstructed from its projections on X and Y by join. Recently it has been 
shown [VI] that the condition is not necessary, i.e. if 2 consists of general first-order sentences 
then m % and v y can t> e complementary without the reconstruction operator being the join. 
However, we show that this cannot happen if we impose more restrictions on 2: 

Theorem 1: 

Let 2 consist of functional dependencies and join dependencies. Then X, Y are 
complementary iff 2N*[^T, Y\. 

Proof: The "if direction is immediate: if 2 implies the MVD *[X, Y\, then for every legal 
instance R we have irrfR)** y(R)=R. Consequently, if for two legal instances J?, R' we have 

*lW-»iflO and vy(R)=iry( R ')> we 8* **(*}** }(*)=* xt^'lf*') and &° m mis 
R=R\ i.e. X, Y are complementary. 

For the "only if direction, assume that 2 does not imply the MVD *[X, Y\; we will show 
that X, Y are not complementary, by exhibiting two distinct legal instances R, R' for which 
*x(R)=irx(R') and my(R)-vy(R'). 
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Let (r be a join dependency *[/?/, ... , Rf, define M(a) to be the set of MVD's 

{ *[ U /€S Rj , \J£ S R$, S } , S 2 a partition of {1 q} } (see also [MSY]). If 2' is the set 

we obtain if we replace each join dependency a in 2 by the multivalued dependencies in M(a), 
then, since a implies each MVD in M(a), 2 implies 2'; but by our hypothesis 2 does not imply 
*[X, Y], so 2' does not imply *[X, Y] either. Now since 2' consists of FD's and MVD's only, 
there is a two-tuple counterexample to this implication [SDSF], i.e. there is a relation R 
consisting of two tuples /i and v which satisfies all the dependencies in 2' but does not satisfy 

*[X, n 

From the relation R construct another relation R' as follows: since R does not satisfy 
*[X, Y\, it must be that ^xnY\=p[XT\Yi and also n[Y-X\*,[Y-X\ and p.{X-Y\*p[X-Y\. Let R' 
consist of a tuple p' which agrees with ft on X and with v on Y-X, and of a tuple v which 
agrees with v on X and with ft on Y-X. Clearly, #*#, /?' satisfies all the dependencies in 2' (it 
defines the same "special truth assignment' {SDSF] as R), and also m^R^ii^R') and 
vy(R)!=vy(R'). Thus, we only need to show that R and R' are both legal, i.e. they both satisfy 
all the JZ>'s in 2 (they obviously satisfy the FD"s in 2, since these are included in 2' and R, R' 
satisfy 2*). 

Let *[R ]t ... , R J be a /Z) in 2; to show that it holds in R, it suffices to show that, if a 

tuple £ is obtained by joining tfiRj], ... , tjRJ, where £ ; ^ are tuples of R, then either 

£= ft or |=f. This is certainly true if i 1 =...=i q =fi or if ti=...=Z q =p; else, let 
S ; ={/:$.= M }, S 2 ={/: £,•=*}. Since the MVD *[Vj€Sj R i> U i€S 2 R $ is in2 '« itholds 
in R, and thus either £=/* or £=i». Thus /? satisfies all dependencies in 2, and so does fl' (by 
the same argument). This completes the proof. ■ 

Notice that our condition (though not the proof) parallels the result of Rissanen on 
independence [Rl]. Intuitively, independence is stronger than complementarity, and thus our 
Theorem contains only the first condition of [Rl]. To see why, consider the classical Employee- 
Department-Manager scheme. The decomposition into X=ED, Y=EM is not independent, 
although X and Y are complementary. 
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Theorem 1 has some algorithmic consequences: 

Corollary 1: 

Given ((/, 2), X, YCU, whether X, Y are complementary can be tested in polynomial time. 

Prrof: By Theorem 1, testing for complementarity amounts to inferring an MVD from a set of 
FD's and JD\ The latter can be done in polynomial time [MSY, V2]. ■ 

Corollary 2: 

Given (U, 2) and XCU, we can find in polynomial time a minimal (nonredundant) 

complement of X. 

Proof: Simply start with the trivial complement U and repeatedly take out any attribute in X 
which can be taken out without affecting complementarity (examine the attributes in some 
arbitrary order). I 

Thus we can program in a database system some guidance to the user towards the 
definition of a complement Unfortunately, as so often happens, finding the minimum is much 
harder. 

Theorem 2: 

Given ((/, 2), XCU and fcK), determining whether there is a complement Y of X with 
\Y\-k is MP-complete. 

Proof: Membership in AT* is obvious: just guess a subset Y of U with \Y\=k and verify 
(Corollary 1) that X, Y are complementary. 

To prove the hardness part, we will make a reduction from the 3-satisfiability problem 
(3-SAT), which is known to be A7»-complete [Ck, K, GJJ. Let 9 be a Boolean formula in 3- 
conjunctive normal form (3-CNF); let x t /=l,...,n, be the variables occuring in qp, and let fj, 
;'=l,...,in be die clauses of <p. We construct the following schema S={U, 2): U is 
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F 1 ...F m X 1 X' I ...X n X' n A and 2 contains the functional dependencies F J ...F m X i -+X' i , 
F l ...F m X' i -*X i , i=l n, and also for each clause fj=ljj+lj2+ l j3> ;- 1 .-> WI > *e functional 

dependencies Lj^Fj. L j2~* F f L j^ F J ^ l Ji =X r L ;V =jr r if / ;Y =-IJC r L ji =X< ^ 

Now let X be Fj...F n JCjX' j...X n X w ; we claim that X has a complement Y with |7| = l+n 
iff <p is satisfiable. To see this, first assume that <p is satisfiable, and let h be a satisfying 
assignment. Take Y to be L^.L^A, where L i =X i if /^ is true, L ( =X ',- if tfxjj is false. To 
show that X F are complementary, it suffices to show (by Theorem 1) that Df=*[X, Y]\ to do 
that, we use the chase method for inferring dependencies [MMS]: if we consider the tableau 
consisting of a row with distinguished variables in the X columns and a row with distinguished 
variables in the Y columns, then we can convert the second row into a row of distinguished 
variables by using the FZ)-rules corresponding to the FZ)'s in 2 as follows: first, since h satisfies 

A at least one of the FD's {Lji~+Fj, L j2~* F f L j3~* F ? can ** used to fil1 in F f and *** can 
be done for all ;. Then the FD's F lm F m X { -*X' t F / ~.F ni r ,-**,- can be used to fill in the 

remaining Jiff's and X'fs. 

For the converse, suppose there is a complement Y of X with |y]=l+n. Clearly Y has to 
contain at least one of {X t X"$ (else there is no way to fill in both X i and X-), and thus Y 
contains exactly one of {Xj, X $ for each i (also A£Y). Consider now the assignment h, where 
h(x$ is true if Xfc Y and false if Xfc Y: since Fj is filled in, at least one of {L j]t Lp L j3 } must 
be contained in Y, and thus h satisfies fj. This is true for all j, so h satisfies <p and the claim is 
established. 

Finally, it is easy to see that 5_, and X can be constructed in time polynomial in the length 
of <p. This completes the proof. I 

Observe that in our reduction we only used FD's, so Theorem 2 is true even if 2 is 
constrained to contain only FD's. Now if 2'={Z-*-*fl | Z->B is an FD in 2}, then if a is a 
JD 2Na iff 2'Na [BV]. Thus, we might as well replace 2 by 2' in our proof, which means 
that Theorem 2 is true even if 2 is constrained to consist of MVD's only. 
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3. The Translation of Insertions 



3.1. Testing Translatability 



2 is now a set of functional dependencies: we furthermore assume that each FD in 2 is of 
the form X-+A, where A is a single attribute (this is easy to enforce, by replacing each FD 
X-*Y in 2 by the equivalent set of FD's {X-*A: A£Y}). 

Suppose that the view X and its complement Y are given, and so is the current instance V 
of the view. We wish to translate the update u on the view consisting of the insertion of a tuple 
t, while keeping the complement iiy(R) constant How can we design an update on R, T w 
which achieves this? 

The translation T u should have certain obvious properties: 

A. It should implement the view update, that is »j^T u {/fy= VUt. 

B. It should keep the complement constant, according to the prescribed semantics; that is, 
myCT^R^ityiR). 

C. It should yield a consistent database, that is, if R is a "possible" instance, 7y/?]N2. The 
meaning of "possible" is the subject of property D below. 

D. A more subtle but important assumption is that the user proposes the update based on his 
knowledge of the view and on no other information concerning the database. Thus, the 
translation should produce a legal database for all legal instances of the overall database, given 
the instance of the view. 

It is quite interesting that these properties determine precisely when the insertion of a tuple 
t in an instance V of the view is translatable, and, if it is, the translation T u is unique. 

First, suppose that tt V (otherwise T u is the identity). Since *y(R) must be kept constant 
(Property B) we must assume that iiXnY]£v xnY (R)=vxr\Y( v ^ otherf/is&, the only way to 
insert tin vy(R) (Property A) would be to insert something in <* y(R). By Theorem 1, ATI Y is a 
superkey of either X or Y. If it is a superkey of X, then the update is clearly untranslatable, 
because VUt is not the projection of a legal instance (Property C). So XT\ Y-* Y. It follows that 
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the only T u satisfying A, B and C is the insertion of the tuple t*n y(R) in the database R: 
T^R^R U t*wy(R) (* denotes the natural join). 

It remains to determine under which conditions TJ.R] is legal (Property C). The insertion of 
t into V is translatable iff TJ_R]\=I. for all R such that AN 2, irx(R)= V (Property D was used 
here). 

Suppose that the insertion is not translatable. This means that there is a functional 
dependency, say Z-*A, which is violated by TJR] for some R for which rtl=2 and v%(R)= V. 
Since R satisfies Z-*A, the inserted tuple must be the culprit Thus, there must be a tuple r of 
V which agrees with / on ZHX and, if A€X, disagrees with t on A. Furthermore, if we fill the 
rows of V with new symbols in the columns of Y- X, only with i[Zn(Y-X)]=pi[Zr\(Y-X)] where 
/1 is a tuple agreeing with t on XDY (call this relation R(V, t, r, Z-+AJ) and then perform the 
chase [MMS] wrt 2 on this relation, no two distinct elements of V, neither the elements 
corresponding to t[A], fi[A] (if A£Y-X), are ever equated (if they are, we say the chase 
succeeded). It turns out that this is a necessary and sufficient condition for untranslatability: 

Theorem 3: 

The insertion of / into V (tiV) is translatable as R+R U t*iry(R) if and only if 

(a) WnY\£n xny (V). 

(b) 2 implies XDY-*Y, and 2 does not imply Xf\Y-*X. 

(c) Chase^lflfK u n J)\ succeeds for all functional dependencies y€2 and tuples r of V. 

Proof: By the preceding discussion, all we need to notice is that, if Chase 2 [/?f K, U r, j)\ does not 
succeed for some FD /€2 and some tuple r of V, then it actually provides us with a 
counterexample, i.e. it constructs a relation R such that /fN=2, ir^/?>= V, and T^R] violates/ 
In the opposite case, the chase actually provides us with a proof that there can be no relation R 
such that *N2, vx(R)= V, and T^R] violates some j€2, i.e. 7y/?]l=2 for all R such that 
M=2, my(R)=V. I 
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Corollary: 

Whether an insertion is translatable can be tested in time 0(| *1 5 /og| K| |2| 2 \Y-X\). 

Proof: Clearly, condition (a) can be tested in time 0(\V\), and condition (b) can be tested in 
time 0(\2\) (using the linear-time algorithm [BB] for inferring an FD from a set of FD's). Since 
condition (c) can be tested by doing 0fl2| \V\) chases, it suffices to show that the 
chase of R(V, t, r, J) can be computed in time 0(\V\ 2 log\V\ |2| \Y-X\). Recall that the chase 
procedure consists in repeatedly locating a pair of tuples /i, v such that /i[Z] = i'[Z] and 
/aM]*!-!/!] for some FD Z-+A in 2, and replacing the element p[A] with v[A] throughout the A 
column. This can be done by the following straightforward algorithm: 

Initialize R* to be R(V, t, r, f). 
Repeat until no new change is made on R : 
For each FD Z-*A in 2 do: 

Sort R* lexicographically according to the elements of the Z columns. 

Find the first pair of consecutive tuples /*, v such that n[Z]=p[Z], n[A]*p[A]. 

Replace p.[A] by v[A] throughout the A column. 

It is clear that each execution of the body of the for loop takes time 0([V\log\V\), so each 
execution of the for loop takes time 0(\V\lo&V\ |2|>. Since each time the for loop is executed 
the number of distinct symbols in the Y-X columns is reduced by at least one (if the chase ever 
attempts to equate two different elements in one of the X columns we stop immediately), and 
initially we have \Y-X\\V\ such symbols, the for loop will be executed at most \Y-X\\V\ times, and 
so the total time is at most 0(\V\ 2 log\V\ |2| \Y-X\). I 

The algorithm described above can be speeded up by taking the following straightforward 
shortcut: to construct R(V, t, r. Z-*A), first fill the rows of V with new symbols in the columns 
of Y-X, then do a chase (and store the resulting relation to be re-used for other members of 2), 
and then set t[Zn(Y-X))=ii[Zn(Y-X)], for some ft agreeing with t on XT\Y. However, since we 
are still unable to provide a better quarantee for its worst-case performance than 0(\V\ 3 lo&V\), 
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its applicability in practice is dubious, in view of the fact that \V\ is normally very large. For this 
reason, we will now present two alternative tests for which we can show better upper bounds to 
their worst-case performance. However, our tests will be stronger than necessary, i.e. in addition 
to rejecting all untranslatable insertions, they may also reject some translatable ones. 

Test 1 

Our first alternative test consists in simply avoiding to do a full chase on R(V, t, r, Z-*A); 
instead, for each tuple /i agreeing with / on XC\Y, we do a chase on the two-tuple relation 
consisting of r and ji, and we report success if any of these chases equates t[A], n[A) (if A£ Y-X; 
notice that in this case n[A)=1[Al since XHY-+Y), or attempts to equate two distinct elements 
of V. Thus, what we are actually doing is imposing the extra requirement that 
Chase 2 [/J(T, /, r, f)\ succeeds fast, if it succeeds at all. Intuitively, this does not seem to be very 
restrictive, and one may hope that Test 1 will actually accept most of the translatable insertions 
that will occur in practice. 

The test can obviously be implemented in time 0(\V\ 2 \2\). However, we can do better (in 
terms of the dependence of the time complexity on \V\\ as follows: 

1. Fill the rows of V with new symbols in the Y-X columns. Then determine the set of tuples 
T={n: n[Xr\Y]=txr\Y]}. This can be done in time 0(\V\). 

2. For each ZCU, construct a copy of the relation T (call it T z ), and sort it according to the 
contents of the Z columns. This can be done in time Ofd™ \V\log\V\). 

3. For each ZCU, compute the closure of Z under 2, i.e. the set Z + -{A: 2b=Z-*A}. This 
can be done in time 0(2^ \2\) (using the algorithm of [BB] for computing closures). 

4. For each ZCU, go through the table T% from top to bottom and, whenever a tuple agrees 
with the previous one on Z, make it agree on Z + . This has the effect of making all tuples 
which agree on Z to agree on Z + (as they should), and it can be done in time 00^ \V\). 
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5. For each FD Z^-A in 2 do: 

For each tuple r for which t[Zr\X\= iZC\X] and AA*AA (if &*), do: 
Make r agree with /n on ZH(Y-X), where /x is a tuple in T. 
For each ZCU do: 
Insert r in T%. 

If r[Z]=p[Zl where i» is either the tuple next to r or the tuple before r in 7^, 
then make r agree with v on Z + . 

This can be done in time 0(|2| \V\ 2^ log^). 

Thus, the overall time expended is 0(\V\log\V\ 2^ \Z\). Of course, there are various 
optimizations and shortcuts one may employ in an actual implementation (for example, to 
handle the potential problem of having too many sorted tables - say by actually having for each 
Z a sequence of pointers to the tuples of 7). Observe that the running time of this algorithm will 
be better than our worst-case upper bound for the exact translatability test (and also better than 
the obvious 0(\ V\ 2 \1\) algorithm) if \V\/log\V\ > 2^ which is definitely going to be the case in 
practical situations. 

Test 2 

Our second alternative test has a somewhat different flavor: notice that Test 1 saves time by 
doing only part of the computation necessary for each particular chase. Test 2, instead, will only 
do one full chase, if this is possible. 

More specifically, recall that the essential part of the translatability test (in terms of time 
requirements) is checking if for all R such that /tt=2, vy(R)= V, we have r,J/f]N=E. Suppose 
now that Y actually has the following property: 

For all Rj, R 2 such that R^l, Rjt=l, v^R^v^R^, 

4jmy]€ffj n /*/>=* jm^y- we have that Wft* 1 iff fjtjh^ 

We call such a Y a good complement of X. Our interest in good complements lies in the 



22 



fact that, if Y happens to be a good complement of X, then clearly all we need to do to test if 
the insertion u is translatable is consruct some relation Rq such that fy*^, v^(Rq)= V, and 
test if rj/yNS. We can construct such an R by filling the rows of V with new symbols in 
the Y-X columns and then doing a chase; this can be done in time 0(\V\ 2 !o^V\ |2| \Y-X\). 
Testing if T^R^Rq U fir y(Rq) satisfies 2 amounts to testing if for each tuple /i of Rq, the 
two-tuple relation consisting of /i and (*iry( R o) satisfies all the FD's in 2; this can be done in 
time 0(\V\ \2\). 

Thus, all we need to do is show how one can test if a given complement Y of X is actually 
a good complement. Observe that this property is independent of the tuple / to be inserted, i.e. 
it is a property of the schema only (X, Y and 2). 

Suppose, then, that Y is not a good complement of X. This means that there are two 
relations R } , R 2 such that /J 7 »=2, /? 2 *=2, m^R^ vrfRJ, 4^fl€»*nKV =,r Jrni'W' 
rj/J 2 ]N2 and T^Rj\=R l U fir^Rj) violates some FD in 2, say Z-+A. Since Rjb=2, 
there must be a tuple nj in Rj such that ft i[Z\=v ^1*11^1)}, M/M^wJ'*' /*/>]• Also 
there must be a tuple vj in /?/ such that v j[XC\Y]= i[Xr\Y\. 

Since v^R^v^R^, we can then find two tuples fi 2 , v 2 of # 2 such that /i^M^pf], 
p^^rylX]. Now consider the relations R'j, R' 2 , consisting of /i/, pj and of p^ "2 
respectively. Clearly, /?';N2, /J'jjN^ (since 2 only containd FD's), irrfR' /^wj^V' 

<pfny]€ w;m K*V= *Ani<*V' V^ 2 and W/l violates z "^ ,/t 

Thus, y is not a good complement of X iff there are two relations R' j, R' 2 with at most 
two tuples each which witness this fact 

From the above observation, we can easily see that we can test if Y is a good complement 
of X by doing the following for each FD Z-*A in 2: 
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Initialize T.j to be a relation with three tuples ji/, pj, tj as follows: 

t][W\=a for each W in U, 

vj[W\=a for ^ in Y, a } for W in X-Y, 

lij[W\=a for JP in Z, a 2 for W in U-Z. 
Initialize T 2 to be a relation with three tuples ji^ "2- '2 ^ foUows: 

t 2~ t l' P 2 =P I' f l ji^ = a 2 for each W in U. 
Repeat until no new change is made on either Tj or Tf 

Compute the chase wrt 2 of pj, v l (in this and all subsequent chases, to equate 
a t and a* i < ;, replace ctj by aj). 

Compute the chase wrt 2 of /i^ v% of v^ 1} and of /t^, 1% 
N [X\+iijLX], rjlXl+rjiX]. 

When the above procedure terminates, we check if nj[A]=tj[A]. If not, then Tj, T 2 
constitute a counterexample to the goodness of Y; if it turns out that njlA]=tj[A\ for each FD 
Z-*A in 2, then we have actually proved that no such counterexample with at most two tuples 
in each relation can exist, and so Y is a good complement of X. 

Since each execution of the repeat loop can be done in time 0(\2\), and each time we lose 
at least one out of 0(\U\) symbols, the running time of the algorithm is 0(\2,\' \U\) (the 
procedure is repeated at most |2| times). 

Notice that, if Y happens to be a good complement of X, then actually Test 2 accepts 
precisely the translatable insertions, whereas in the opposite case it rejects all insertions. 
However, testing if Y is a good complement of X can be done once and for all at the time Y is 
declared as the complement to use, and if it is found to fail then the database system can simply 
disregard Test 2. 
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3.2. Complexity of Testing Translatability 

So far, we have shown how one can test if a proposed insertion of a tuple into a view is 
translatable, and if so, how to do the translation (Theorem 3). We presented an 0(\ V\ hg\V\) 
algorithm for testing translatability. Since this algorithm is likely to be inefficient in practice, we 
also developed two alternative stronger tests, which can be executed faster. 

In the sequel, we are going to prove a result which has some negative implications 
regarding the extent to which one can hope to improve the running time required to test 
translatability. Specifically, we will show that, if the view is presented in an exponentially 
succinct way (i.e. as a union of Cartesian products) then testing translatability becomes U/-hard 
[St]. This result provides strong evidence against the possibility of having an algorithm that runs 
in time less than 0(\ V\), i.e. it indicates that the whole view has to be examined in order to test 
translatability. 

Moreover, we believe that this result also casts some doubt on the possibility of 
substantially improving the running time of our algorithm. Loosely speaking, II /-hardness 
seems to indicate that the problem lacks a "nice" combinatorial structure, which could be 
exploited to yield an algorithm considerably more efficient than the one resulting from our 
(more or less) straightforward approach. 

We will now prove the result: 

Theorem 4: 

Determining if an insertion is translatable is II /-hard if the view V is given implicitly as 
the union of two Cartesian products, of total size 0(\ U\). 

Proof: Let G be a Boolean formula in 3-CNF, containing the variables Xf, /=1,...,«, and 
consisting of clauses fj, j=l,...,m, and let X={xj, ... , x k }, K= {**+/. ••• . *„) be a given 
partition of the set of variables of G. It is known [St, W] mat determining if for all possible 
assignments of truth values to the variables in X G is satisfiable, i.e. if V X3 Y G(X. Y)= 1 (where 
Vr means Vjc/.-Vx^ etc.) is Il/-complete. In what follows we give a polynomial-time 
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reduction from this problem to the problem of testing translatability of an insertion to a 
succinctly presented view. 

Let U be BX t X } ...X n X fl AF ] ...F m C, and let 2 consist of the FD's X } X )...X $ k ~* A, 
Fj...F m --C, BA->C, and, for each clause fj=ljj + Ij2 +1 j3 of G < me FZ) ' S L jl A ~* F f L j2 A ~* F f 
L j3 A-+Fj (where L jt is X r if l fi is x r and L jt is X \ if I j( is ->x r ). Let the view be 
BX 1 X' } ...X n X n . and let the complementary view be XjX 1 ...X n X , n AF J ...F m C. Finally, let the 
instance V of the view be s B Xs x %• x ~ Xs XJf U s ' where S XJC- is a relation over x fii 
consisting of two tuples fij, v { with m^=0, [1/^=1. *m= l > »W=°. *5 is a sin 8 le ^P^ 
over 5 with s^[5]= 6, and s is a single tuple over BX 1 X ] ...X n X n with s[i?]=a, fX$=h 
iX ]=1. Observe that K is essentially just a list of all possible truth assignments: each tuple ft 

of V, with the exception of s, defines an assignment h: {x } Jc„}-*{0, 1} by taking 

hC Xi )=n[X}L (/*[*> -»^); also, n[B]=b. 

Suppose now we want to insert in V the tuple /, where i[B\=b, 
iX 1 X 1 ...X r pC^=AX]X' l ...X n X'^. We will show that this insertion is translatable iff 
VX3Y G(X, Y)=\. First, it is obvious that conditions (a) and (b) of Theorem 3 are satisfied. 
Furthermore, observing that the only tuple agreeing with / on X I X t ] ...X n X' n (the common part 
of the view and the complement) is s, it is easy to see that condition (c) is satisfied if the FD /is 
taken to be XjX 1 j...Xjpt k -+A (because the only tuple agreeing with / on XjX ]...X k X'j c is s), or 
if /is F]...F m ->C (since no attribute of /is in the view), or if /is Lj t A-*Fj (since s agrees with / 
on all possible L./s). 

Thus, all we have to show is that, for all tuples r with t*s (these are the tuples agreeing 
with / on B), Chase 2 [fl<r, U r, BA-*C)} succeeds (i.e. starting with AA]=iA] we eventually 
equate t[Q, $Q) iff there is a satisfying assignment h for G which agrees with the one defined 
by r on {*/ x$. 

First, suppose there is such an assignment h, and let r h be the tuple corresponding to it. 
Since r^X 1 X 1 ...X k X k ]=iX I X 1 ...X k X' k l r&A]=A*l so r£A\=iA\. Since h satisfies f f 

r£Lj$=l for some i, so rfrLj^ALfi, i.e. r^Lj^ iL^A], and so r£F}=iFj. for;=l nu 

Thus, r f lF 1 ...F n j=iF 1 ...FJ, so r^Q^iQ. But since r£BA]=i[BAl r$C\=AC\> and thus 
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r[Q = iQ, i.e. Chase 2 [rtfK t, r, BA^Q] succeeds. Conversely, it is not difficult to see (by 
essentially tracing the previous argument backwards) that r[Q, iQ can only be equated if there 
is a tuple corresponding to a satisfying assignment and agreeing with r on XjX j-Xj^C^ 
Thus, we have established that the insertion of t into V is translatable iff 
VA17 G(X, Y)=l. Since U, 2, / and the description of V as a Cartesian product can obviously 
be constructed from G, X, Y in polynomial time, we are done. I 

It certainly is not surprising that using a similar (only simpler) construction we can show an 
analogous result for Test 1: 

Theorem 5: 

Determining if Test 1 accepts an insertion is co-AW-complete if the view V is given 
implicidy as the union of two Cartesian products, of total size 0(\ U\). 

Proof: We first show membership in co-NP: if X denotes the view and Y the complement, then 
the following is a non-deterministic polynomial time algorithm to determine if Test 1 does not 
accept the insertion of the tuple / into V: guess an FD Z~*A in 2 and two tuples r, /i over X, 
and verify that r€V, M €K, iZnX\=AZnX\ (and AA]*^A], if A£X), n[XnY\= WHY}; 
construct a relation R consisting of two tuples r, /*', with r'[X]=r, /*'[*] =fi, and with new 
symbols in the Y-X columns, only with r'[Zr\(Y-X)]= p'[Zn(Y-X)]; compute the chase wrt 2 of 
R, and verify that it does not attempt to equate two distinct elements of r, p, and, if A€ Y-X, it 
does not equate t[A], p\A\. 

To prove the hardness part, we will make a reduction from unsatisfiability of Boolean 

formulas in 3-CNF. Let G be such a formula, with variables Xj, i'=l n, and clauses fj, 

;=l,...,m. Let U be BXjX y.-Jf^C, and let 2 consist of the FD's B-*C, and, for each clause 
f.= l. I+ lj 2 +lj 3 of G, the FD L^L^C, let the view be BXjX j..*^ „, the complement 
be XjX^.XfjXff, and the instance V of the view be s B Xs x tf , x - Xs X n X' U * where 5 # 

S XX are ** in * e Pro0 ^ of T^ 01 " 6 " 1 4 ' ^ $% =a > 4^=°. £X"jl=0. 

Suppose now we want to insert in V the tuple t, where iB)=b, 
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W 1 X' 1 ...X n X' n ] = iX 1 r 1 ...X n X' n ]; we claim that this insertion is accepted by Test 1 iff G is 
unsatisfiable. To see this, observe that the only tuple of V agreeing with / on XjX ^...XJC n (the 
common part of the view and the complement) is s; by a reasoning similar to that in the Proof 
of Theorem 4, we see that the only FD which needs to be checked is B-+C. Now if r*s, the 
chase on r, s will equate r[C\, iQ iff for some ;, A^ji L j2 L j^^ L jl L j2 L j^ ie - 't L //] =0 ' whicn 
means that the assignment corresponding to r does not satisfy fj. Since this should happen for all 
tuples r, the claim is established. 

Since U, 2, / and the description of V as a Cartesian product can be constructed from G in 
polynomial time, the proof is complete. I 
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3.3. Finding a Complement 

So far, we have assumed the following scenario for translating view updates: when the user 
updates a view, he also specifies unambiguously the semantics of the update by defining a 
complement which should be kept constant during the translation. We studied in detail the 
problem of checking if a proposed insertion of a tuple into a projective view is translatable, 
when the complement is another projection and the database consists of a single relation 
satisfying a given set of functional dependencies. 

However, a real database system should also be able to provide the user with some 
assistance concerning the task of defining a complement. We already gave a glimpse at this 
problem in Section 2, where, after we characterized complementary views, we examined the 
problems of finding a nonredundant complement and a minimum complement. Now that we 
have also gained some understanding of testing translatability, we can pose the following 
question: Suppose the user wishes to have the update translatable, imposing only partial (or 
none at all) restriction on the complement to be used. How can one determine a complement 
which will render the update translatable? 

Let the view be X, and suppose Y is a complement of X such that the insertion of the tuple 
/ into the instance V of the view is translatable under constant Y. Clearly, Y= WU(U-X), where 
WCX. Since W^v^fV) (condition (a) of Theorem 3), there is a tuple r of V such that 
/fW1= W Consider now the set of attributes W r ={A: A€X, t[A]=^A]}. It is immediate that 
iWJ£m w (V), Zt=W r -*Y (since 2.¥=W-*Y by condition (b) of Theorem 3, and W^DW), and 
2 does not imply W r -*X (if ~Z¥=W r -*X, then the insertion of / into V is not translatable since 
iW^=^W^, t*r); moreover, if R is a database such that tfNZ, v x (R)=V, then 
fvw(R)=1*itw(Rl and thus since R U t*v w (R)b='2, it follows that also R U f*w w (Rp=?., 
for all such R. Therefore, the insertion of f into V is also translatable under constant 
Y r = WJJ(U-X). 

From the above discussion, it is easy to see the following: 
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Theorem 6: 

Given 2, X, V and /, we can find a complement Y of X such that the insertion of t into V 
is translatable under constant Y within min(\V\, lW) tests of translatability. 

Proof: One can compute, for each tuple r of V, the set W r ={A: A£X, r[A]=^A]}, and, after 
eliminating duplications, test, for each such W r if the insertion of t into V is translatable under 
constant Y r = W r U{U-X). If no such W r is found, then, by the preceding discussion, there is no 
complement Y of X such that the insertion of t into V is translatable under constant Y. I 

Thus, we can determine if there is a complement which renders a given insertion 
translatable in polynomial time (see the Corollary to Theorem 3). Observe, however, that the 
polynomial complexity depends strongly on the fact that we are allowing the whole view V as 
part of the problem instance. The following result indicates that there is an inherent exponential 
dependence on \U]+lo$V\; in other words, we may nevertheless have to check all possible 
subsets of X in order to find a complement. 

Theorem 7: 

Determining if there is a complement Y of X such that the insertion of / into V is 
translatable under constant Y is iVP-hard if the view V is presented succinctly (as in Theorem 4). 

Proof: Let G be a Boolean formula in 3-CNF, containing the variables x t /=l,...,n, and 
consisting of clauses L j= l,...,m; assume furthermore with no loss of generality that the 
variables appearing in each clause are distinct. Let U be X 1 X J ...X r ^' n Fj.,.F m , and, for each 

clause fj=lji+lj2+lj3 of G > let 2 c 0012 " 1 me m * L j^ F f L J2~* F t L P~* F f Ut ** viCW X 
be J / r / ..Jr n r /p and the instance V of the view be s% X .X- X *jr Y » wnere s XpC i fe M "* 
the Proof of Theorem 4. 

Suppose now we want to insert in V the tuple /, where iX^~iX^=l. We will show that 
there is a complement Y~ WUFj...F m (WCX) such that the insertion of t into V is translatable 
under constant Y, iff G is satisfiable. First, it is easy to see that, since i W\ should be mity^V), 
W should contain at most one of the attributes Xj, X \ for each /; furthermore, since we should 
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have that 2l=W-»F 7 ...F m (clearly 2 does not imply W-*X), it is not difficult to see that W 
should define a satisfying assignment for G. 

To complete the proof, observe that, if J? is a database such that #1=2, irx(R)= V, then for 
any two tuples /i, v of R, p[FJ=p[Fj. This happens because, if p[£//]* "[£//] and 
v\L,d*9[LA (in the opposite case obviously p[Fjl= r[Ffi, then there is a tuple {in R such 
that i[Ln\=f\Ltji. ^[Lj 2 ]=v[Lj 2 ] (because the variables in fj are distinct); thus, £[Fjl= fi[FJi, 
£[Fji=i>[Fl and therefore n[Fj=p[Fl It follows that, for the insertion of / into V to be 
translatable under constant Y, it suffices to have W^vy/V) and 2t= W-*Fj...F m . 

Finally, U, 2, X, t and the description of V as a Cartesian product can be constructed from 
G in polynomial time, and we are done. I 

We remark that, by following a similar line of reasoning, one can see that Theorem 6 
remains true if we interpret "translatable" as "accepted by Test 1 (Test 2)", and "test of 
translatability" as "Test 1 (Test 2)". The same holds for Theorem 7 (the reductions, although 
somewhat subtier, are based on the same idea as the reduction given in the Proof of Theorem 7, 
and are therefore omitted). 
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4. The Translation of Deletions and Replacements 

In this Section we briefly show how the ideas developed previously for the case of 
translating the insertion of a tuple to a view can be adapted in a straightforward manner to 
handle the case of deleting a tuple and of replacing a tuple with another. We continue to assume 
that 2 is a set of FZ)'s satisfied by the database R, and that we are given the view X, the 
complement Y and the current instance V of the view. 

4.1. Deletions 

Suppose we wish to translate the update u on the view consisting of the deletion of a tuple 
/, /€ V, while keeping the complement ir y(R) constant. The update T u on R which achieves this 
should satisfy v^T^R^V-t, vy(TJ i R])=iry(R), and also TJ/?]N2 for all R such that 
/?N2, wx(R)= V (compare with Properties A through D given for the case of an insertion). 

Now since w y(R) must be kept constant, we must have that t[XT\ Y\£vxf) y(T-# in other 
words, there is a tuple r€V such that rtt, r[XC\Y)= WHY]. From this we now see that XHY 
cannot be a superkey of X (since V is a projection of a legal instance), so by Theorem 1, 
XC\ Y-* Y. It follows that the only possible candidate for T u is the deletion of the tuple (*ny(R) 
from the database R: T£R\=R-fiiy(R). 

But now observe that, since fJ/JJC/f and 2 only containds FD\ T' U [/?]N2 if KN2. Thus, 
our last requirement that 7„[/?]N2 for all R such that tf*=2, m^R)= V, is satisfied trivially. 

We have thus shown the following: 
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Theorem 8: 

The deletion of / from V is translatable as R*- R-l*m y(R) if and only if 

(a) txnY]£ir xny (V-t). 

(b) 2 implies XC[Y-*Y, and 2 does not imply XHY^X. I 

Hence, determining if a deletion is translatable can be done in time 0(1H+|2l>. 

4.2. Replacements 

Suppose now the update we wish to translate under constant complement Y is the 
replacement of a tuple tj, //€K, by a tuple tj. t 2 €V. The update T u on R should satisfy 
v x (TJ i R])= V-tj U t} and again w y (T I J/?y=w^>, and r,J/?]N2 for all R such that KN2, 
irx(R)=V. We distinguish two cases: 

case i //[jsrnjr^jrnyi. 

This case exhibits a behavior similar to the one we are already familiar with: specifically, 
since iry(R) must be kept constant, we must have //[A r ny]€wjp,y( r K-^;, ljXC\ Y\€"b X C\ y(V). 
From this it follows that ATI Y cannot be a superkey of X, and thus it is a superkey of Y by 
Theorem 1. Hence, the only possible candidate for T u is the replacement of the tuple tfir y(R) 
by the tuple tfrty(R). 

To check now if the last condition is satisfied, i.e. if FJ/JJN2 for all R such that AN 2, 
•nx(R)= V, it is not difficult to see (by a reasoning exactly analogous to the one given for 
insertion) that all we have to do is check if Chase 2 [J?(T, i^ r, J)] succeeds for all FD's /in 2 
and for all tuples r in V which are different from tj. 

Case 2 tj[XnY\=tJLXnn 

In this case we see that the first two conditions can be satisfied with no further restrictions 
on V, t, X, or Y, and moreover the only possible candidate for T u is replacing the set of tuples 
tj*fty(R) by the set of tuples 'tfwy(R) ( we can D0 longer assert ^ before ^ either set wU1 
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consist of a single tuple, since this depended on X(l Y being a superkey of Y, which is no longer 
necessary). 

Checking whether the last condition is satisfied, i.e. whether rjflJNS for all R such that 
/?N2, iix(R)= V, can still be done by checking if Chase 2 (/?<T, i t r, J)] succeeds for all /in 2 
and for all r in V, r*t 1 (one can see that the fact that tfwyfR) and t 2 *iry(R) may consist of 
more than one tuple does not affect anything). 

Thus, we have the following: 

Theorem 9: 

The replacement of /, by t 2 in V (//€K t 2 lV) is translatable 
as R+R-tf-nyfR) U t 2 *vy(R) if and only if 

(a) tfiXnY\£m xr[ y(V-ti) and t£XT\ Y]^XH Y< V 2>> or tj[XnY]=^^n 

(b) 2 implies XT\Y-*Y and 2 does not imply XHY—X, or tfiXnY\=t£Xr\.Yl 

(c) Chase^[R(V, tp r, f)\ succeeds for all / in 2 and for all r in V, r*tj. I 

From Theorem 9 it should be clear that one can develop results analogous to the ones 
given for the case of insertion in a straightforward way. Thus, we will not pursue this direction 
any further. 



34 



35 



5. Explicit Functional Dependencies 

Functional dependencies assert that a certain mapping is one-to-one -for example, a 
mapping from employee-project pairs to managers, or from cost-price pairs to rates of profit. 
However, there is a difference; certain such mappings are essential information stored by the 
database (as in the first example above), whereas others are redundant information, mappings 
that could be computed explicitly (as in the second example). We call the latter case of FU% 
explicit FZTs (EFD's). 

EFD's are important in the context of views and view complements, because they can 
seriously affect the information content of database mappings. We thus felt that we should study 
their behavior vis-a-vis the other known classes of dependencies. We first define formally what 
an EFD is: 

Definition: 

A set of attributes X explicitly determines a set of attributes Y (notation: X~* e Y) if there is 
an instance-independent function /(called a witness of X-* e Y) such fhat.*xy(R)=J(vx(R)), for 
any legal instance R of the database. 

Examples. Cost-Profitrate-+ e Price, Course-Student-Grade-» e Average-Grade. 

We remark that, in our definition of an EFD, no special property of the witness function/ 
is assumed. This leads naturally to the following extension of the meaning of implication of an 

EFD a from a set of dependencies 2, where a t /=1 *, are the EFD's in 2: for all functions 

f t i=l,...,k, there is a Junction f such that, if a database R satisfies all dependencies in 2 (where 
fj is taken as the witness of a t ), then it also satisfies a (where /is taken as the witness of a). 
In case a is not an EFD, then one just omits the requirement of the existence of / 

As we are going to see shortly, with this approach EFD's behave very much like FD's (in 
the sense of Propositions 1 and 2). It would be interesting to see what happens if one imposes 
natural restrictions on the witness function / such as invertibility, 0-1-valuedness, etc. 
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In the following, if 2 is a set of dependencies, we denote by Z F the set of 
FD's {X-+Y: X-+ e Y is in 2}. 

Proposition 1: 

Let 2 be a set of EFD's; then 2N*-* e Y iff 2 /i t=*-»T. 

Proof: Consider the following chase procedure for computing X + : initialize X + to X; 
repeatedly locate a member Z-*B of 2^ such that ZCX + and B is not contained in X + , and 
set X + to X + UB. As is well known [MMS], this procedure terminates with a unique X + , and 
furthermore 2 / ,*=J-r iff YCX + . 

We will now argue that also 2NX-> e Y iff FCX+. First, if YQX+, then it is clear that 
IN^g 7 by the construction of X + (observe that, if X-> e Y and Y-+ e Z, then X-* e Z). 
Conversely, if Y is not contained in X + , we will show that 2 does not imply X-* e Y. For each 
EFD Z-* e B in 2, pick as its witness a function fz-*B ^ ch that /z-»jB^Z^ =:r Z# where 'Z is 
a tuple over Z with t^W\-a for all JF, and < z # is a tuple over Zfl with lzB^ =a for ^ ^ 
Now if g is a purported witness of X-* e Y, then consider the database R consisting of a single 
tuple / with i{W\=a for W£X + , and ftW\=y otherwise, where y*v A (gftX])), for some A in 
y-*. It is clear that R satisfies each EFD Z-* e B in 2 (with witness /z_^), but R does not 
satisfy X-+ e Y with witness g. I 

Proposition 2: 

Let 2 be a set of £FZ)'s, and let 2' be a set of FZ)'s and 7/)'s. Suppose that 2U2W: 

(a) If a is an FZ) or JD or embedded JD, then 2j*J2W. 

(b) If a is an EFD, then 2*=a. 

Proof: 

(a) If 2j*J2' does not imply a, then there is a relation R which satisfies 2^U2' but violates 
a. Now since /N=2^, clearly we can pick a function f t for each EFD a, in 2 such that R also 
satisfies 07 with witness y}. Thus, R satisfies 2U2\ and therefore 2U2' does not imply a. 

(b) Assume that 2 does not imply or, and observe that the one-tuple relation R constructed in 
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the Proof of Proposition 1 also satisfies 2' (since it satisfies any FD, JD or embedded JD. Thus, 
R satisfies 2U2' and violates a, and so 2U2' does not imply a. ■ 

Thus, we can easily augment any of the known axiom systems for FD\ FD's and MVHs 
etc. to include EFD's. Moreover, our characterization of complementary views (Theorem 1) can 
be extended to include EFD's as follows: 

Theorem 10: 

Let 2 be a set of FD's, JD's and EFD's. Then X, Y are complementary iff: 

(a) They are complementary when considered as views of irjnjyW (i.e., 2 implies the 
embedded MVD XClY-*^X-Y\Y-X); and 

(b) 2NATjy-» e U. 

Proof: The "if direction is immediate: from (a) m ^Rfit yfR)=* 'XU^ for every legal 
database R, and then from (b) R =/w A1 j y(R))=f[ii rfR)* w y(R)), where /is an instance- 
independent function. Thus, if for two legal instances R, R' we have v^RJ^vj^R') and 

wy(R)=*y(R'), we get R=J^x( r ^y( R >=^x( r '^y( R ')= R '' Le - x - Y m 
complementary. 

For the "only if direction, assume first that (a) is false, i.e. 2 does not imply the 
embedded MVD Xf\Y-*-*X-Y\Y-X . We first remark that the Equivalence Theorem of [SDSF] 
is also true if a is an embedded MVD (using the partial extension of the equivalence between 
dependencies and formulas to include embedded MVDs described in Section 7; the 2-tuple 
Subrelation Lemma can be extended to the case in which a is an embedded MVD, by an 
argument analogous to the one given for the case in which a is an MVD). Using the same 
construction as in the Proof of Theorem 1 (combined with Proposition 2 (a) and the above 
observation), we obtain two distinct two-tuple relations R, R' such that wj^RJ^n^R'), 
•ny(R)=iiy(R'), and R, R' satisfy all the FD's and JD's in 2 and all the FD's in 2^. Then it is 
easy to see that we can pick, for each EFD a in 2, a function /such that bom R and R' satisfy 
a with witness / This shows that X, Y are not complementary. 
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If (b) is false, then (XV Y) + *U, where (XI) Y) + is the closure of XV Y wrt 2p Let R, R' 
be two one-tuple relations such that R[W\ = R'[W\=a for W in (XUY) + , and R[W\*R\W\ 
otherwise. Clearly, R*R\ ir x (R)=w x (R'), *y(R)=my(R'), R, R' satisfy all FD's and JD's in 2, 
and moreover by picking as the witness of an EFD Z-*B in 2 a function fz-*B as in the Proof 
of Proposition 1, we see that R, R' also satisfy the EFD's in 2. This shows that X, Y are not 
complementary, and the proof is complete. I 

Intuitively, Theorem 1 stated that, if the only dependencies present are Fffs and JD\ then 
the only way to reconstruct a database from two projections is by join. Theorem 6 states that, if 
EFD's are also present, then the only way is to join the two projections and then explicitly 
compute the information which is still missing. 
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6. Conclusions and Directions for Further Research 

In this work, we have studied some of the computational problems arising when one 
considers applying, in the context of the relational model, the methodology suggested by 
Bancilhon and Spyratos for translating view updates. We discovered that certain important 
problems such as testing translatability and determining a complement which renders an update 
translatable, although solvable in polynomial time (Theorems 3, 6, 8, 9), exhibit an interesting 
kind of inherent complexity (Theorems 4, 5, 7), which indicates the existence of limitations on 
how efficiently they can be solved. However, we have only concentrated on a very simple case of 
the application; we feel that much remains to be done before a reasonable account of the 
applicability of the methodology can be attempted. In particular, the following possibilities seem 
to us to be worthy of further investigation: 

(1) Allowing more general dependencies: In particular, it would be interesting to see to what 
extent can Theorem 1 be generalized, especially in view of the negative result of [VI]. More 
importantly, though, one should study the problem of testing translatability and designing a 
translation (recall that we found the translation of deletions to be trivial just because we only 
considered functional dependencies). It is conceivable that our basic idea of a chase-type 
algorithm will be useful, although we cannot see to what extent 

(2) Considering views that are a restriction of a projection (i.e. of the form <rp>*x> where P ® a 
predicate on tuples): It should be noted that most of the views occuring in practice are actually 
of the above form. The complement here can be a pair of views, e.g. (a-,/* opm y) or 
(o-, p ir x , »y), where ity is a complement of m x - We believe that, in the case of only 
functional dependencies (which is still very important from a practical viewpoint), our basic 
approach can be used with only simple modifications (at least for certain Pi). 

(3) Considering multi-relation databases with views that are projections of joins of relations: 
this is most important, given that the universal relation assumption is being criticized as 
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unrealistic. Wc also believe that this is likely to be the theoretically most interesting direction. 

(4) Studying the explicit functional dependencies: It seems to us that EFD"% are a step in the 
right direction, if one wants a model capable of capturing the information content of database 
mappings. We have already examined their influence on complementarity of views (Theorem 
10). Their effect on issues like testing translatability or designing a translation (perhaps in 
conjunction with refining our definition to capture more semantics) is a question which we feel 
deserves further research. 
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